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Chapter 5 Manifolds 


We are now going to begin our study of calculus on curved spaces. Everything we have 
done up to this point has been concerned with what one might call the flat Euclidean spaces 
R”. The objects that we shall now be investigating are called manifolds. Each of them will 
have a certain dimension m. This is a positive integer that tells how many independent 
“coordinates” are needed to describe the manifold, at least locally. For instance, the surface 
of the earth is frequently modeled as a sphere, a 2-dimensional manifold, with points located 
in terms of the two quantities latitude and longitude. (This description clearly holds only 
locally — for instance, the north pole is described in terms of latitude = 90° and longitude is 
undefined there. Further, longitude ranges between —180° and 180°, so there’s a discontinuity 
if one tries to coordinatize the entire sphere.) 

We shall thus be concerned with m-dimensional manifolds MM which are themselves subsets 
of the n-dimensional Euclidean space R”. In almost all cases we consider, m = 1, 2,..., or 
n—1. There is a case m = 0, but these “manifolds” are zero dimensional and thus are just 
made up of isolated points. The case m = n is actually of some interest; however, a manifold 
M CR" of dimension n is just an open set in R” and is therefore essentially flat. MZ and R” 
are locally the same in this case. 

When M Cc R” we say that R” is the ambient space in which M lies. 

We usually call 1-dimensional manifolds “curves,” and 2-dimensional manifolds “surfaces.” 
But we shall generically use the neutral word “manifold.” 























A. Hypermanifolds 








Assume R” is the ambient space, and M C R” the manifold. Given that we are not very 
interested in the case of n-dimensional M, we distinguish manifolds which have the maximal 
dimension n — 1 and we call them hypermanifolds. 


IMPLICIT DESCRIPTION. Suppose R” —> R is a function which is of class C!. We 
have already thought about its level sets, sets of the form 




















M = tel) e@e RR”, g(a) = ch, 





where c is a constant; see p. 2-42. The fundamental thinking here is that in R” there are 
n independent coordinates; the restriction g(x1,...,2%,) = c removes one degree of freedom, 
so that points of M can locally be described in terms of only n — 1 coordinates. Thus we 
anticipate that M is a manifold of dimension n — 1, a hypermanifold. 

A very nice example of a hypermanifold is the unit sphere in R”: 








S(0,1) = {e|a# ER", |l2|| = 1}. 
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There is a very important restriction we impose on this situation. It is motivated by our 
recognition from p. 2-43 that Vg(a) is a vector which should be orthogonal to MW at the point 
x € M. If M is truly (n—1)-dimensional, then this vector Vg(x) should probably be nonzero. 
For this reason we impose the restriction that 


forall ce M, Vog(x) £0. 


Notice how the unit sphere $(0,1) fits in this scene. If we take g(x) = ||z||?, then Vg(x) = 
2x. And this vector is not zero for points in the manifold; the fact that Vg(0) = 0 is irrelevant, 


as 0 is not a point of the manifold. We could also use g(x) = ||z||, for which Vg(x) = 2||2||~1 
is never 0. The fact that g is not differentiable at the origin is irrelevant, as the restriction 
||z|| = 1 excludes the origin. 











We now present four examples in R? showing what sort of things can go wrong in the 
absence of this assumption. In all these cases we write g = g(x,y), c= 0, and the “bad” point 
is located at the origin, so Vg(0,0) = 0. 





EXAMPLE 1. x?—y*?=0: 








EXAMPLE 2. 2724+ 2?-y?=0: 


Vg = (2x + 3x”, —2y). 








EXAMPLE 3. —2?+2?-y? =0: 


Vg = (—2a +32", —2y). 








Manifolds 3 


EXAMPLE 4. x*—y?=0: 








PROBLEM 5-1. This problem generalizes the situation of the sphere S(0, 1) described 
above. Let A be an n x n real symmetric matrix. Suppose that the set M = {x | xe 
R”", Axe x = 1} is not empty. Then M is called a quadric in R”. Prove that it is a 
hypermanifold. That is, prove that for alla € M V(Arexz) 40. 


PROBLEM 5-2. Manifolds do not necessarily have to be “curved” at all. Thus 
suppose h € R” is not zero, and suppose c € R is fixed. Prove that {x | « € R", hex = c} 
is a hypermanifold. 


PROBLEM 5-3. Suppose the set 
M={a¢|c ER", Arer+her=c} 


is not empty. (Here A is an n x n real symmetric matrix, h € R” is not 0, c is a real 
number.) Assume that A is invertible and that A~'heh #4 —4c. Prove that M is a 
hypermanifold. 


PROBLEM 5-4. — Continuing with the preceding situation, assume A is the identity 
matrix. Prove that 


M ={x|reER", ||2||? + hex = —|All’/4} 





is not a hypermanifold. 


B. Intrinsic gradient —- warm up 





In this section the scenario is that of a hypermanifold M Cc R", where M is described 
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implicitly by the level set 


(We can clearly modify g by subtracting a constant from it in order to make M the zero-level 
set of g.) We assume Vg(xz) 4 0 for 2 € M. 


We are definitely thinking that Vg(x) represents a vector at « € M which is orthogonal 
to M. We haven’t actually defined this notion yet, but we shall do so in Section F when we 
talk about the tangent space to M at x. We’ve discussed this orthogonality on p. 2-43, and 
it is to our benefit to keep this geometry in mind. 





A recurring theme in this chapter is the understanding of the calculus of a function LR. 
For such a function we do not have the luxury of knowing that f is defined in the ambient 
space R”. As a result, we cannot really talk about partial derivatives Of /Ox;. These are 
essentially meaningless. 








For instance, consider the unit circle 2? + y? = 1 in R? and the function f defined only on 
the unit circle by f(x,y) = 2? — y. We cannot really say that Of /Ox = 2x. For we could also 
use the formula f(x,y) = 1—y? —y for f, which might lead us to believe 0f/Ox = 0. And 
anyway, the notation Of /Ox asks us to “hold y fixed and differentiate with respect to x,” but 
holding y fixed on the unit circle doesn’t allow x to vary at all. 


Nevertheless, we very much want to have a calculus for functions defined on M, and at the 














very least we want to be able to define the gradient of M 4, Rin asensible way. We shall not 
succeed in accomplishing this right away. In fact, this entire chapter is concerned with giving 
such a definition, and we shall finish it only when we reach Section F. 


However, we want to handle a very interesting special case right away. Namely, we assume 


Re 4 R, so that f is actually defined in the ambient R", and we focus attention on a 


fixed  € M. As such, Vf(x) € R” exists, but this is not what we are interested in. We 
actually want a vector like V f(x) but which is also tangent to M. That is, according to our 
expectations, which is orthogonal to Vg(x). Here’s a schematic view: 
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Vi (x) 


V f (x) 


Veo) 


What we can do is orthogonally project V f(x) onto the tangent space to M at x, by the 
simple device of subtracting from V f(x) the unique scalar multiple of Vg(x) which makes the 
resulting vector orthogonal to V g(a): 


(Vf (2) — eVg(a)) ¢ Vg(a) = 0. 


Thus 
V f(x) ¢ Vg(2) 
IVg(x)I? 
You should recognize this procedure; it’s exactly the theme of Problem 1-12 (see also pp. 
1-14 and 15). 





C= 


DEFINITION. The intrinsic gradient of f in the above situation is the vector 


V(x) ¢ Vg(2) 
I|Vg(a) ||? 


An immediate example of interest is the defining function g itself. It clearly satisfies 





Vf(z) = Vf(z) - Vg(2). 


Vg(x) = 0. 




















PROBLEM 5-5. Prove in fact that if R > R is differentiable, then ¥(yog)(x) = 0. 


6 Chapter 5 


PROBLEM 5-6. More generally, prove this form of the chain rule: 


V(po f(z) = o'(F()) VF (2). 


Our notation W displays the difference between the intrinsic gradient and the ambient 
gradient V. However, it fails to denote which manifold is under consideration. The intrinsic 
gradient definitely depends on M. Of course, it must depend on M if for no other reason 
than we are computing Wf(zx) only if x belongs to M. We illustrate this dependence with the 
simple 





EXAMPLE. Let M be the sphere S(0,7r) in R”. Then we may use g(x) = ||x||? (or ||2||?—r?), 
so that Vg = 2x and we obtain the result 


_ re Vio) 


r2 


Vf(x) = VfF(@) 


PROBLEM 5-7. Let f(x,y) = 2? — y and calculate for the manifold 2? + y? = 1 
Vf = x(2y + 1)(y, -2). 


Repeat. the exercise for the function h(x, y) = 1— y? — y, and note that Wf = Wh. 


PROBLEM 5-8. The intrinsic gradient in the preceding problem is zero at which four 
points of the unit circle? Describe the nature of each of these “intrinsic critical points” 
(local maximum, local minimum, saddle point’). Repeat the entire discussion for the 
function obtained by using the polar angle parameter: 


f (cos, sin 0) = h(cos@,sin 8) = cos? 6 — sin 0. 


That is, use single variable calculus and the usual first and second deriative with respect 
to 0. 





PROBLEM 5-9. Let 1 <i <7 be fixed and let M be the hyperplane {x € R” | x; = 
0}. Calculate wf for this manifold. 
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PROBLEM 5-10. — Show that in general 
IVf(z)|| = [MV F(2)|| sin 8, 


where @ is the angle between the vectors V f(x) and Vg(z). 
PROBLEM 5-11. Prove the product rule: 


V(fh) = fwvh+hrft. 


Vx, - 
; 





EXAMPLE. Let f(x) = and let M be the unit sphere in R”. Then 





j 
/ Wey 





Vi(xz) = €:-—(reé\)x 
= ey ime On I Or ad 2 
= (1 — x}, -—21%9,..., 112). 


Notice that Wf is quite a bit more complicated in form than Vf. 
Also, whereas Vx, = €, is never 0, it is clear that Wx; = 0 is a real possibility. In fact, it 
is zero at the two points +é€,. These happen to be the two points of the sphere where the 
function x, attains its extreme values. 

Another way of phrasing the definition of the intrinsic gradient is to let N denote a unit 
vector at x which is orthogonal to M. (What this means precisely will be discussed later.) 
Then we expect that Vg(x) = cN for some real c ¥ 0, so that 


V f(x) =Vf(x) —Vf(x) e NN. 











PROBLEM 5-12. Let M be a 1-dimensional manifold (a “curve”) in R?. Let T 
be a unit vector tangent to M at x, and prove that Wf(z) is related to the directional 
derivative of f by the formula 





Vi(x) = Df(«;TYT. 
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PROBLEM 5-13. = Suppose R” 4 Ris homogeneous of degree a. For the manifold 
which is the sphere S(0,r) of radius r show that 
vravp—- 22. 


r2 





Another extremely important property is the fact that wf is truly intrinsic: it depends 
only on the knowledge of f when restricted to the manifold. This is not quite clear at the 
present stage of our development because we have employed the ambient gradient Vf in our 
definition. However, the following argument should serve to make it intuitively clear. In 
order that Wf just depend on the function f restricted to M, it must be the case that two 
functions which are equal on M turn out to have the same intrinsic gradient. Equivalently, 
their difference has zero intrinsic gradient. Equivalently, if f = 0 on M, then Wf =0 on M. 
This seems reasonable, as we believe that if / is a level set of f, then Vf must be orthogonal 
to M; that is, Vf is a scalar multiple of Vg. But then the orthogonal projection of Vf 
orthogonal to Vg is zero: Wf = 0. 

Rather than continuing with this discussion at the present time, we instead turn to some 
wonderful numerical calculations. 


C. Intrinsic critical points 





We continue with the notation and ideas of Section B, so that M is a hypermanifold in R” 
described by an equation g(x) = 0. We say that x € M is an intrinsic critical point of f if 
Vf(x) =0. That is, 

V f(x) ¢ Vg(2) 
IVa(@)I? 
This is a rather daunting equation to solve for x, but it helps to notice that it says precisely 
that V f(x) equals a scalar times Vg(a); for then taking the inner product with Vg(x) shows 
that the scalar must be the one displayed. This scalar 4 is called a Lagrange multiplier for 

the problem. We then are required to solve the equations 





Vile) = Vg(x). 


Vi(t) =AVg(x), 
g(x) =0 (as az € M). 


The “unknowns” are both x and 4. If we count scalar unknowns and equations, we have n+ 1 
unknowns 21,...,%, A, as well as n + 1 equations. Hopeful! 
The equation g(x) = 0 is often referred to as the constraint. 
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As is usual in similar situations, setting up the equations is the easy part. Solving them is 
the hard part, as they are usually nonlinear. Here are some examples. 


EXAMPLE. Find the intrinsic critical points of f(z,y) = x(y — 1) on the unit circle 
gt+y=l. 
Solution. To eliminate some “2’s” we can use g(x,y) = (2? + y? — 1). Then the Lagrange 
formulation is: 

y= 2) = (x,y), 

ety =1. 
Thus y — 1 = Ax and x = Ay. Thus 


y—1 = dy, 
sO 
y(?—-1) = -1 
Thus 
_, = _,. =* 
0 ee 1 
Finally, the constraint gives 
Abt _ 
OF=1F ~ © 
Thus 
M+1 = (7-1)? 
= \*— 2)? +1: 
eee 


A = 0. (0,1); 
= ee 
aa (=4.-4), 
V3 1 
A = =V3: (3-4). 


Here’s a sketch, together with the evident natures of the critical points relative to M: 
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MAX MIN 








Notice, by the way, that (0,1) is actually an ambient critical point of f: Vf(0,1) = (0,0). It 
is the only one. 


PROBLEM 5-14. Just as in Problem 5-8, analyze the function we just studied by 
examining the function cos 6(sin 6 — 1). 


EXAMPLE. Find the intrinsic critical points of f(z,y) = 2° + 8y on the ellipse 
oe ep 
Solution. The Lagrange formulation is: 
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Thus ry = 4/3, so the constraint gives 


11 


xv 1/4 — 
7+3(=) =e 
x 8 
me aa ; 
O7* +32 = 36x’: 
924 — 3627 + 32 = 0; 


(327 — 4)(3a7— 8) = 0; 
oY = hot 878. 


Thus we find four more points: + (/4. 3) and + (,/3. 3). Here is a sketch of the six 


intrinsic critical points: 





GLOBAL MAX LOCAL MIN 
LOCAL MAX 
LOCAL MIN 
LOCAL MAX GLOBAL MIN 





In both of the above examples we drew conclusions about intrinsic extreme values; that 
is, Maximum and minimum values of the relevant function’s restriction to the manifold. We 
shall prove in Section E that such intrinsic extreme values are indeed intrinsic critical points, 


just as in the case of functions defined on 














R” (see Section 2G). 


It is quite interesting to consider the special case of quadratic forms, as we did in proving 


the principal axis theorem in Chapter 4. 


So let A be an n X n symmetric real matrix. We 
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there were analyzing A by means of the Rayleigh quotient 


Arex 
|x|?’ 





and we essentially found its critical points in R". The homogeneity shows that to be the same 
as finding the intrinsic critical points of Av ex on the unit sphere. Thus we ask for points x 
satisfying 

v(Arexz) = 0 and |iz|| = 1. 


The Lagrange formulation gives 


That is, 


Ag = Xz, 
lz|| = 1. 


Thus the intrinsic critical points of Ax ex on the unit sphere are precisely the eigenvectors of 
A! As an example of this procedure, work out the following problem: 


PROBLEM 5-15. Find the intrinsic critical points of (x + y)(y + z) on the unit 
sphere x? + y? +27 =1. 


PROBLEM 5-16. For the Rayleigh quotient function Q(x) = Are z/||x||? show that 
the intrinsic gradient on the unit sphere equals 


vQ=VQ 
= 2Ar —2Arere. 


Here are six more or less routine exercises, followed by six challenging ones. 
PROBLEM 5-17. Use the Lagrange technique to find the points on the parabola 


y? + 2x = 8 which are closest. to the origin. 


PROBLEM 5-18. Find the minimum of 7*+4ary+y’ on the hyperbola 2?—y? = 1. 
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PROBLEM 5-19. Find the minimum distance from (9, 12, —5) to points on the cone 
in R® given by 42? = 2? + y/. 


PROBLEM 5-20. Consider the function f(z, y) = x on the level set y?—x? = 0. Show 
that f attains its minimum value at the origin only. Show that the Lagrange formulation 
fails to produce this result. Explain why. 


PROBLEM 5-21. Repeat the preceding exercise for the function f(x,y) = y on 
the set M : y? = x® + 2°. In this case show also that the level set M is a bona fide 
1-dimensional manifold in R?. 


PROBLEM 5-22. Let a, b, c be positive constants. Find the intrinsic critical points 
or + ‘ + € on the unit sphere et+yt+2=1. 


PROBLEM 5-23. _ Find the intrinsic critical points of 2(a, + v2 + %3)(@1 + %2 + £4) 
on the unit sphere in R’. 


PROBLEM 5-24. Find all the intrinsic critical points of f(x) = x3+23+23+2r1 2973 
on the unit sphere in R®. 


PROBLEM 5-25*. Find all the intrinsic critical points of f(a) = 13 +a23+23-—32,r9%3 
on the unit sphere in R®. 


PROBLEM 5-26**. Let a be an arbitrary but fixed real number. Find all the intrinsic 
critical points of f(a) = x? + 73 + 13 + ax1x2x3 on the unit sphere in R*, and count how 
many there are depending on the value of a. 


PROBLEM 5-27. Find all the intrinsic critical points of f(x) = x? + 73 + x3 on the 
unit sphere in R°. Determine the maximum and minimum values of f on the sphere. 





The next problem is a gem I found in the book Ideals, Varieties, and Algorithms, by Cox, 
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Little, and O’Shea. (David Cox is a 1970 BA Rice, majoring in mathematics, and is now on 
the faculty of the Department of Mathematics, Amherst College). 


PROBLEM 5-28*. Find all the intrinsic critical points of f(x) = 7? + 2x 2973 — x3 
on the unit sphere (there are ten), and the maximum and minimum values as well. 


PROBLEM 5-29*. — Consider the ellipse a + a = 1 where 0 < b < a. For any 
point (z,y) on the ellipse construct the line through that point which is orthogonal to 
the ellipse. This line intersects the ellipse in another point (z’,y’). Let D(x, y) denote 
the distance between (x, y) and (2’,y’). Find the minimum value of this function by first 
showing that 
9(2 ate y? )3/2 
Dia, y) = Sa 2 
a 1 56 
and then using the Lagrange technique to minimize D(z, y). 


(Answer includes D=V27a?b? (a?+b?)~3/? in case a>V2b.) 


D. Explicit description of manifolds 





Frequently hypermanifolds in R” are described in terms of graphs of functions 
R"-1 , R. This means that on the manifold one of the coordinates in R” is expressed 
“explicitly” as a function of the other n — 1 coordinates. As there is no particular coordinate 
to prefer over another, we shall with no loss of generality consider the abstract situation in 
which M is represented by a formula 











Gg = Bison hye i) 


Of eduise, M may then be thought of as represented implicitly in terms of the zero level set 
of R" —4 R, where we simply define the new function g by 








CD). = By =O a va BA): 
The normal vector is then given by 


Vg = (—Dig, ee =D, 1) 
= (-Vy, 1) (for short). 


Notice that Vg is definitely not zero! 
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Now we give an interesting calculation to show what Wf looks like in this framework. We 
shall require only the values of the function f on the manifold M. A convenient way to use 
these values is to define an associated function fy on R"~! by the formula 














fo(21, cara tna) = f(x, ++ 5 Un—-1; y(#1, Bo li a) 


Notice that the new function fp indeed uses only the evaluation of f at points of M. 
Note first that the chain rule implies 


Difo = Dif + Dnf Dey, 1l<k<n-1. 





Using vector notation in R”"?, 


Vifo = (Drfo,---, Dn—1fo) 
SDF cc Deal) + Def Dig s.s Dae): 





Now we simply regard V fp as a vector in R” with n'* component 0, which we write as 
Thus 


V fo = (Dif,---,Dnif, Daf) + Daf (Dig, ---;Dn19,—-1) 
= Vf —D,fVg. 


Now we are all set to compute the intrinsic gradient of f. By definition 





VfeV 
es iar eV 
+ Dy, e 
a! eameuanas |7/aaen, 
= Vi— eae V9~ Pal Vs 


We summarize: 


16 Chapter 5 


THEOREM. In the above contezt, where M is given explicitly 





M ={eeR” | ty = Otero staa) 





we define 
fil Vig cea ne) = Fi wih eal May cacy te) 
Then VfreV 
w= Vos Cy. 


In particular, the intrinsic gradient Wf depends only on the restriction of 
f to M. 


This theorem is of great theoretical importance in that it shows dramatically the intrinsic 
nature of Vf. However, it does not appear to be of any particular use in solving exercises, as 
the Lagrange formulation of the preceding section is indeed quite applicable. 


PROBLEM 5-30. Prove that f has an intrinsic critical point at r <=> fo has a 
critical point at the corresponding point. 


E. Implicit function theorem 


We now turn to a theorem of immense importance in the study of manifolds. It actually 
provides the complete understanding of intrinsic gradients. More than that, it shows that 
hypermanifolds which are described implicitly can also be described explicitly. Thus it is well 
named: THE IMPLICIT FUNCTION THEOREM. 

We do not prove this theorem (one of the “hard” ones) in this course. It is commonly proved 
in beginning courses in mathematical analysis. However, we very much need to understand 
exactly what it does (and does not) say. 

Suppose then that the hypermanifold M C R” is described implicitly in a neighborhood 
of 2) € M by the equation 














g(x) = 0. 


As usual, R” —> R is assumed to be of class C', and Vg(x) 4 0 for all x in a neighborhood 
of x9. We want to describe M explicitly near x9, so what we need to do is solve the equation 
g(x) = 0 for one of the variables in terms of the others. Say we succeed in solving for x, as a 
function of %1,...,2%,—1: in a neighborhood of x9 we then have a situation 























ge) = 0. SS By — as oc nmi 





for some function y defined on a neighborhood in R™?. 
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It is surely reasonable to expect that we need a condition like 0g/0x,, 4 0 in order to do this. 
At the very least, we need g(x1,...,2%n) to involve x, in a significant way. Furthermore, the 
manifold rz, = y(11,...,%n—1) has the normal vector (—Vy,1) with nonzero n“ coordinate, 
and thus the normal vector Vg should also have a nonzero n™ coordinate. 

As a simple example take g(x) = ||z||? — 1, so that we are dealing with the all-important 
unit sphere as M. Then Vg(x) = 22. Solving g(x) = 0 goes something like this: 


p= 1-7 -+s-—g 





| 
He 
os 
| 
8 
Hho 
| 
| 
8 
3 
u 


Ln 


The choice of sign gives the “upper” or “lower” “hemisphere.” 








We see one thing immediately: we cannot in general expect the explicit presentation of M to 
be anything but local. We also are going to want y to belong to class C! so that we can do 
calculus. In this example this C+ quality will not happen if x? +---+22_, = 1 (the “equator” 
o,, =). 

Now suppose % € $(0,1) and its n coordinate x, 4 0. Then Vg(xo) = 2% # 0. 
Furthermore, we can solve for x, near 2. If 2, > 0, we obtain the + sign above, and the 
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reverse if x, < 0. Thus in the latter case x,, < 0, we have 





i = -l-a}---- a2, for apts ta, <1 


Of course, the entire sphere can be handled in a similar way since Vg(ao) = 229 4 0 requires 
some coordinate x,; of 2) to be nonzero, and we can then solve for x; locally: 





= 2 2 2 2 
= [Vata ee a2 ah 02, 


The above simple example of the sphere is completely typical. Here is the result. 











IMPLICIT FUNCTION THEOREM. Suppose the hypermanifold M C R” is described 
as the level set 





g(x) = 0, 
where R" —> R is of class C! and Vg 40 on M. Suppose xp € M. Suppose that 0g/Ox;(xo) # 


0. Then there exists R°-' + R of class C! such that for all x in a sufficiently small 
neighborhood of xo, 

















Oe 0. SS pS Oi ae eg ei): 





Though this is an existence theorem and as such does not provide a clue about the actual 
calculation of y, the chain rule gives “explicit” formulas for the partial derivatives of y. For 
we may start with the functional identity in the variables 71,...,%;-1, Vi44,---,%n:! 


Og se A Oca hy) =O. 

Now differentiate this identity with respect to x; for any 7 #7. The chain rule implies 
Therefore we conclude that 
Dig 
On the right side of the latter equation, the partial derivatives of g are evaluated at x; = y. 
Notice the appearance of the nonzero quantity Djg in the denominator. 

There is a nice moral to get from all of this. Solving the equation g(x) = 0 for x; in terms 
of the other coordinates is likely to be a very difficult task. But once that has been done and 


the above function y has been produced, the calculation of the partial derivatives D;p is very 
simple. In fact, it’s a linear task. You have seen this sort of “implicit differentiation” in your 





Dyp = - 
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introductory calculus courses. For instance there are lots of exercises of the following nature: 
the equation 

ne +siny —y—2? =0 
is satisfied by a function y = y(x) near x = 0, and y(0) = 7. Compute dy/dx. The solution 
is obtained by performing d/dz: 


dy dy dy 
TY | go + “~—-—“-27=0. 
“ (: dx v) BONY ee dx - 
Then solve: 
dy 2a — mye™¥ 





dx  mxe™ + cosy — 1 
How simple. (Never mind that we don’t really “know” the terms y = y(x) on the right side.) 
Notice that when x = 0 and y = 7, the denominator equals —2 and this is not 0. In particular, 


dy ‘i 


dx = 2 : 

Another nice result of the implicit function theorem is the proof that the intrinsic gradient 
vf depends only on M and not on the particular function g whose level set is equal to M. 
This is clear once we check that Vg is uniquely determined by M, up to a nonzero scalar 
multiple; for the formula for Wf shows that the scalar multiple cancels out of the equation. 
More geometrically, Wf is just the vector Vf with the correct multiple of Vg added so that 
the resulting vector is orthogonal to Vg; this makes it clear that only the direction of Vg is 
needed. 





THEOREM. Given a hypermanifold M C R”, suppose it is described as the level set {x € 
R” | g(x) = 0}, where Vg £0. Then Vg is uniquely determined by M, up to a nonzero scalar 
multiple (which may be a function of x). 





PROOF. Suppose for instance that D,g(xo) 4 0. Then the implicit function theorem yields 
a function R"-! —*> R such that near x9 we have 

















eM SS Gy = Oly oe Si): 
Then as above we obtain on M 
D,g + DrgD;y = 0, 1<j7<n-1. 


We conclude 
Vg = Drg(-Ve, 1). 
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If another function g also gives the manifold, and Vg(xo) 4 0, then 9(%1,..-,%n_1, 9(41,---;2n-1)) = 
0 so that the chain rule again gives 


Vg = Drg(-V¢, 1). 


Thus D,,g(x) #0 and Vg(x) =scalar times Vg(z). 
QED 
We can now clear up an issue that we have been ignoring, thanks to the fact that we 
understand that the intrinsic gradient Wf is completely determined by the restriction of the 
function f to the manifold in question. The issue is this: suppose f attains a local maximum 
or minimum value at 79 € M relative to the restriction of f to M. Then we want to know 
that necessarily 29 is an intrinsic critical point of f. Here’s the result: 














THEOREM. Suppose M is a hypermanifold in R” and suppose R” LRisac function 
defined in a neighborhood of a point x9 € M. Suppose that f(x) < f(xo) for alla € M 
belonging to some neighborhood of x9. Then Wf (xo) = 0. 


PROOF. Thanks to the implicit function theorem, we know that M can be represented 
explicitly near 2) by an equation of the form 


ip Tyce fe 4) 


(we have named 2, as the distinguished coordinate for simplicity of writing only). We use the 
function fp of Section D, 


fo(x1, ee »En-1) = f(a, aa »tn—-1, P(e, ats Meat), 


Let x) = (%o1,---;%0n—-1)- Then our hypothesis means precisely that 


fo(z’) < fol(xo) 





for all 2’ € R"! sufficiently near x). Thus 2% is a critical point for the function fo, and we 
conclude that its gradient V fo(a)) = 0. But then the theorem on p. 5-15 yields 
Vioe Ve 


Vf (xo) = Vfo+ vole +1 


(-V¢, 1) 
=i, 


QED 
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REMARKS. Of course, the conclusion still holds if we are dealing with a local minimum 
instead: f(x) > f(xo) for all 2 € M sufficiently near xo. In the next section we shall give a 
somewhat different proof of this result that is even more intrinsic. And in Section G we shall 
learn that the theorem remains valid for manifolds M Cc R” of any dimension, not just n — 1. 














PROBLEM 5-31. THE ARITHMETIC-GEOMETRIC MEAN INEQUALITY. 
Using the following outline, prove that for any 7; > 0,...,%, > 0, 
um eos 








and that equality holds => 7, =---= Zp. 
a. Prove first that you may assume that 7, +---+2%, =1. 


b. Show that the function f = 71...2, restricted to the set 71 +---+2%, =1, 4; >0 
for all 7, attains its maximum value at some Zp. 


c. Show that all the coordinates of this point x9 are positive. 


d. Use the Lagrange technique to determine po. 


PROBLEM 5-32. Find the minimum of x; r2...2,, subject to the constraint 


1 1 1 
Pe ec] = 1, all x; > 0. 
Uy v9 In 





F. The tangent space 


Now we are going to face the problem of actually defining tangent vectors to a manifold. 
Suppose that M C R” is a manifold, not necessarily a hypermanifold. Suppose 7» € M. We 
want to give a definition of tangent vectors to M at x» that is as intrinsic to M as possible 
(how the “inhabitants” of M view tangent vectors). We shall accomplish this by focusing 
attention on curves which lie in the manifold. 








DEFINITION. In the above situation consider all curves (see p. 2-3) y from R to R" which 
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satisfy the following: 


y(t) € M for all t, 


7(0) = Xo, 
y is of class C’. 





Then the velocity vector 7/(0) is called a tangent vector to M at xp. Notice that y'(0) € R”. 
The set of all such vectors is called the tangent space to M at x9, and is written 


Te 
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PROBLEM 5-33. Prove that 0 € T,,M and that if h € T,,M and c € R, then 
Cle 14M. 
(HINT: 7¥(ct).) 














We anticipate that if MM is an m-dimensional manifold, then T/,,.M is an m-dimensional sub- 
space of IR”. The preceding problem indeed shows that scalar multiples of tangent vectors 
at 9 are themselves tangent vectors at 29, but no such simple technique will handle sums of 
tangent vectors. We shall prove all these results when we come to Chapter 6. (Though T,, 
is a subspace of IR” and contains the origin, we always imagine this subspace attached to M 
at the point xo, with the origin in T,,,M being regarded as located at 29.) 

For the present time we continue to focus our attention on the hypermanifold case with a 
theorem that reveals all we really need to know about T°, in this case. 






































THEOREM. Let M be a hypermanifold in IR", described implicitly as a level set 


where g is of class C! and Vg #0 on M. Let xy € M. Then 











Tr M = {he R" | Vg(to)eh = Of. 

















In particular, T,,M is an (n — 1)-dimensional subspace of R”. 
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PROOF. First, suppose h € T,,/. Use a curve ¥ as in the definition above, with 7/(0) = h. 
Then since y(t) € M, 


g(y(t)) = 9. 


Computing the ¢ derivative and using the chain rule, 


Va(y(t))e v(t) = 0. 





Setting t = 0, 
Vgo(to)eh = 0. 
Conversely, suppose h € R” satisfies 
Va(ro)eh = 0. 


Now we have to do something quite significant. We are required to produce a curve in M 
with all the right properties. Since we don’t even know how to produce individual points in 
M, much less a whole curve, we need a theorem of some sort. The implicit function theorem 
serves the purpose perfectly. For ease in writing let us suppose D,,g(xo) 4 0. Then we know 
(thanks to the implicit function theorem) that M can be described explicitly as a graph 


To SG Dig <n BA) 
near Zp. Write x = (201,---,Zon). We then define a curve y(t) by making it affine in the 
“independent” coordinates %1,...,2%,—1 in the following way: 


“h) = te Pat, lag sn—1 
Y(t) = p(n(t),---,%1(4))- 


From the chain rule and the formula on p. 5-18 we obtain 


n-1 
y(0) = S- Djy (wor,.--,Con—1) 74 (0) 
j=l 
n-1 
_ \- _ D59(xo) a 
=i Dng(Zo) : 


But also we have 
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that is, 
n-1 
a Djg9(xo)hj + Dng(£o)hn = 0. 
j=l 

Thus 


Yn(0) = fn. 
This proves that y'(0) = h, as desired. 
QED 
As a nice bonus, we can now easily give a complete understanding of the intrinsic gradient 

















of a function, relative to the hypermanifold WM. Suppose first that IR” , R is of class C! near 
a point x9 € M. Suppose that h € T;,,M. There are now two ways to view this situation. 


(1) We use Vg(xo) eh = 0 and the definition from p. 5-5, 


V f (xo) ¢ Vg(o) 
|[Vg(xo) ||? 





Vf(xo) = VFf(r0) — Vg(Xo); 


to conclude that 


Vf(to)eh = Vf(xo) eh 
= Df(x0o;h). 


Remember from p. 2-14 that Df(ao;h) is our notation for the directional derivative 
of f at xo in the direction h. Thus Wf(zo) is the unique vector in T;,,.M whose inner 
product with every h € T,,M equals Df (xo; h). 


(2) Consider an arbitrary curve y in M such that y(0) = xo and y'(0) = h. Then the chain 


rule gives 

< f (y(t) = VE (r(t)) © 7'@), 
so that 

d 

FZ) | _, = Vso) © 


h 
= Vf (xo) oh. 


It is this second relationship that is so intriguing, since the function f o y depends on the 
behavior of f only on the manifold M and not on the ambient R”. We can therefore extend 
the definition of p. 5-5 as in the theorem we are preparing to consider. 
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But before we state the theorem, we need to explain part of the hypothesis. Namely, we 





are going to assume that M , Ris of class Cl. Since f might be defined only on M, it is not 
immediately clear how to define this continuous differentiability. In fact, a moment’s thought 
might lead to two competing ideas: 


(1) Representing M in an explicit manner such as 
Ln = P(X1,---;Fn-1); 
require the resulting function 


fol Rig<ceg gt = T Sie ey ae ys a2 ye) 





to be of class C' on (a neighborhood in) R”. 


(2) Require that there exist a C1! function R” —, R such that F (x) = f(x) for € M (in 


a neighborhood of some point). 








PROBLEM 5-34. Prove that these two definitions are equivalent. 








THEOREM. Let M be a hypermanifold in R" and x € M. Let M LRbeal! function 
defined only on M. Then there exists a unique vector Wf(xo) in T,,M such that for all C! 
curves y in M such that y(0) = xo, 


FPO) |_, = Velo) #7). 


t=0 


DEFINITION. The tangent vector Wf(zo) is called the intrinsic gradient of f at xo. 
Because of the discussion right before the theorem, it agrees with the definition given on 
p. 5-5 in case f is defined in a neighborhood of xp in R”. 

PROOF. Use the second of the two definitions of C' given above. Thus in a neighborhood 
of x9 there exists some C! function R” —. R which agrees with f on M. Then we simply 
compute 




















“tott) | = SFO) 


t=0 
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thanks to the known properties of the intrinsic gradient WF'(xq) of the ambient function F’. 
This finishes the existence part of the proof, as we may simply define Wf(2o) = WF(2o). 

The uniqueness is a separate argument. If there were two vectors v and w € T,,,M fulfilling 
the conclusion of the theorem, then we would have 


vey (0) = wey‘(0) for all curves ¥. 


Thus, 
(v—-w)eh = 0 for all h € T,,M. 


Since v — w € T;,,.M, we conclude that v — w = 0. Thus v = w. 
QED 


REMARK. Given a C' function M 4 R, there are many ways to extend it to R” Rin 


a neighborhood of a. Each such F has a gradient VF'(x9), but the intrinsic gradient W F(x) 
is independent of the choice of the extension F’. Our results show that each WF‘) is just 
equal to Wf(xzo). Thus we have a very practical “algorithm” for computing Wf. Namely, first 
extend f to an ambient C+ function F; second, compute WF’. The resulting vector is precisely 
vf. 

The above results are in agreement with what we accomplished in the theorem on p. 5-15, 
where we noticed that the intrinsic gradient of a function depends only on its restriction to 
the manifold. The extra information we now have comes from the intrinsic understanding of 
tangent vectors themselves. 











G. Manifolds that are not hyper 


In this section we want to move away from the restriction that MM C R” has dimension 
n —1. Thus we shall study manifolds of dimension m contained in R", where 1 << m<n-—1. 
This range of dimension covers all that we are really concerned with in the present chapter, 
going from m = 1 (“curves”) to m = 2 (“surfaces”) on up tom =n —1 (hypermanifolds). We 
do not deal with m = 0, as 0-dimensional “manifolds” would just consist of isolated points in 
R”, and no actual calculus could be done. At the other extreme, m =n, we would be talking 
about n-dimensional manifolds contained in R” and these are simply open subsets of R”. This 
leads essentially to the “flat” calculus we have studied in detail in Chapters 2—4 and presents 
no new ideas at the present time. 

Again we shall use both an implicit presentation and an explicit presentation of M. In 
addition we shall also consider a third method, a parametric presentation. Here is a preliminary 
summary: 





























IMPLICIT M is defined by n — m constraints placed on the points of R”. 
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EXPLICIT M is defined by giving n — m coordinates of R” as explicit 
functions of the other m coordinates. 














PARAMETRIC M is defined by describing its points as a function of m 
other real variables (called parameters). 








We devote the rest of this section to the discussion of three examples which happen to be 
quite interesting manifolds. 


EXAMPLE 1. M is the ellipse st + 4 = 1 in the x7, — 22 plane, but thought of as lying in 
R°. (The semiaxes a and 6 are arbitrary positive numbers.) As M is surely a 1-dimensional 
manifold, we have m = 1 and n= 3. 








Implicitly, M = {c € R? | 44+ 4 = 1,25 =o}. 


Explicitly, we “solve” the constraint equations near a point of M; for instance, near the 
point (—a,0,0) we can write 


Parametrically, we can express 
M = {(acost, bsint,0) |0<t< 27}; 


thus all three coordinates are given as functions of the single parameter t. 


PROBLEM 5-35. Given the ellipse above, suppose 0 < b«< a. 
Through any point on the (0, b) 
ellipse draw the straight line 
which is orthogonal to the ellipse. 
This straight line contains a point 
which is closest to the origin; let 





(-a, 0) (a, 0) 











D be the corresponding distance between 
this point and the origin. Find a point on 
the ellipse which produces the maximum D, 
and compute this maximum value. 


(0, —b) 
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PROBLEM 5-36. Continue with the same ellipse. 
Through any point on it and in 

the first quadrant, draw the straight 

line which is tangent to the ellipse. This 


line contains a | 


points lie on the coordinate axes. Let 


L be the length 


point on the ellipse which produces the 
minimum L, and compute this minimum value. 
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ine segment whose end 





of this segment. Find a 





PROBLEM 5-37. In the preceding problem calculate the lengths of the two indicated 


line segments w 


hich constitute the optimal L, and show that they are a and b. (Do you 


see a simple geometric reason for this outcome?) 


EXAMPLE 2. 


an axis disjoint fro 


The circle in the x 





Torus of revolution. This is obtained by revolving a circle in R® about 
m it. We arrange things as in the illustration: 


— z plane is 


described as (x — b)? + z? = a’, 


where 0 <a < b. 


Now revolve th 
points (2’,y’,z) €] 








is around the z-axis. Starting with a point (z,0,z) on the circle produces 
R° for which ,/z’? + y’2 = x. Thus the equation of the resulting surface is 


2 
(V2 + y? — b) og? ag” Implicit presentation. 
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Of course, m = 2 and n = 3. 


We can also give a parametric presentation using two angles. The first angle w~ can 
parametrize the circle in the x — z plane in the usual polar coordinate way: + = b+ acosy, 
z = asinyw. To get the revolved surface, we leave z alone, but use b+ acosw as the dis- 
tance from the z-axis. If we have revolved through the angle 6, the x, y coordinates are then 
x = (b+acosy) cos, y = (b+ acosw)sin@. Thus we have 


x =(b+acosi~)cos@, 
y =(b+acosy)sin6, Parametric presentation. 


z =asinw. 
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PROBLEM 5-38. Consider the given implicit presentation of the torus of revolution. 
The point (b + a,0,0) lies on this torus. Show that near this point the torus has the 


explicit presentation 
2 
c= 1 a? eo y?. 


Show that the maximal region in the y — z plane for which this representation is valid has 
the shape: 








(0, & ) semicircle 


Le 
. e ( y,0) 
(0, 0) (B.0) 


What are a, @, and y? 








In a certain sense this torus of revolution can be thought of as the Cartesian product of two 
circles, as two independent periodic coordinates w, # are used in its presentation. However, it 
is not really a Cartesian product. There does exist a very interesting surface, a 2-dimensional 
manifold, which is actually the Cartesian product of two circles. As each circle is contained in 
R?, the Cartesian product we are going to exhibit is contained in R? x R? = R*. This manifold 
is often called a flat torus: 









































EXAMPLE 3. _ Just as R* is the Cartesian product R? x R? of two planes, this 2-dimensional 
manifold M is literally the Cartesian product of two circles: 





M = {(21,22,%3,04) | v?+23=1, 23+2}7=1}. Implicit presentation. 


We can use the polar coordinates for the two unit circles to write the points of M in the 
form 


XL, = cosh, 
Ly =sinh, Parametric 
%3 = cosa, presentation. 


4 =sin Gd. 
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In a very definite sense to be explained later, this manifold is flat, unlike the torus of revolution. 

At any fixed point x € M, we can find two linearly independent vectors orthogonal to M, 
using the implicit presentation; and two linearly independent vectors tangent to MW, using the 
parametric presentation. Namely, orthogonal vectors can be found by using the gradients of 
the two defining functions: 


(%1,22,0,0) and (0,0, 23, 24). 


And tangent vectors can be found by using the partial derivatives 02/00; of the parametriza- 
tions: 
(—sin@),cos@,,0,0) and (0,0, —sin 62, cos 62). 


Thus we can split R* into the 2-dimensional space orthogonal to M at x plus the 2-dimensional 
space tangent to M at x. We have just found the relevant orthonormal sequence: 


(X11, XQ; 0, 0) 
(0, 0, 73, v4) orthogonal to M, 





(23, 7, 0, 0) 
(0, 0, —24, £3) tangent to M. 





Incidentally, we draw no pictures of the flat torus. It cannot be located in R*®, whereas the 
torus of revolution is a hypermanifold in R*. Thus it gives us an interesting example of a 
2-dimensional manifold in R* which is not a hypersurface in R?. 

These two manifolds are “topologically” indistinguishable. Without pausing to define the 
adjective, we go ahead and display a function 




















flat torus 4 torus of revolution 
in a rather obvious fashion. Namely, 
f(@1,@2,03,04) = ((b+ ax3)a1, (b+ ax3)a2, ax). 


PROBLEM 5-39. Prove that f is a continuous bijection of the flat torus onto the 
torus of revolution. Prove that its inverse is given as 


rin) = (agp age SE 


Vety? Je? t+y? a 








and that f~ is also continuous. 
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Thus this function f provides a one-to-one correspondence between the points of these two tori, 
with the property that f and f~! are both continuous. More than that, f and f~! are both 
infinitely differentiable. Thus the flat torus and the “round” torus cannot be distinguished in 
the sense of manifolds. 

However, these tori are quite different geometrically. The round one is really a curved 
surface, and the surface appears quite different at different points. However, though we are as 
yet not equipped to define the adjective “flat,” it is rather clear that all points of the flat torus 
look alike from a geometric perspective. Thus the two tori are distinct geometric objects. 


PROBLEM 5-40. Prove that at any point of the torus of revolution the vectors 


(—sinwcosé, —sinwsind, cosw), 
(—sin@, cos@, 0), 


(cos w cos 6, coswsin 6, sinw), 











form an orthonormal basis of R®, the first two being tangent to the torus and the third 
orthogonal to the torus. 





